Multi-agent deep reinforcement learning based resource management in SWIPT enabled cellular networks with H2H/M2M co-existence

نویسندگان

چکیده

Machine-to-Machine (M2M) communication is crucial in developing Internet of Things (IoT). As it well known that cellular networks have been considered as the primary infrastructure for M2M communications, there are several key issues to be addressed order deploy communications over networks. Notably, rapid growth traffic dramatically increases energy consumption, degrades performance existing Human-to-Human (H2H) traffic. Sustainable operation technology and resource management efficacious ways solving these issues. In this paper, we investigate a problem with H2H/M2M coexistence. First, considering energy-constrained nature machine type devices (MTCDs), propose novel network model enabled by simultaneous wireless information power transfer (SWIPT), which empowers MTCDs ability simultaneously perform harvesting (EH) decoding. Given diverse characteristics IoT devices, subdivide into critical tolerable types, further formulating an efficiency (EE) maximization under divers Quality-of-Service (QoS) constraints. Then, develop multi-agent deep reinforcement learning (DRL) based scheme solve problem. It provides optimal spectrum, transmit splitting (PS) ratio allocation policies, along efficient training designed behaviour-tracking state space common reward function. Finally, verify reasonable mechanism, multiple agents successfully work cooperatively distributed way, resulting outperforms other intelligence approaches terms convergence speed meeting EE QoS requirements.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Agent Deep Reinforcement Learning

This work introduces a novel approach for solving reinforcement learning problems in multi-agent settings. We propose a state reformulation of multi-agent problems in R that allows the system state to be represented in an image-like fashion. We then apply deep reinforcement learning techniques with a convolution neural network as the Q-value function approximator to learn distributed multi-agen...

متن کامل

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate endto-end learning of protocols in complex environments inspired by communica...

متن کامل

Lenient Multi-Agent Deep Reinforcement Learning

Much of the success of single agent deep reinforcement learning (DRL) in recent years can be attributed to the use of experience replay memories (ERM), which allow Deep Q-Networks (DQNs) to be trained efficiently through sampling stored state transitions. However, care is required when using ERMs for multi-agent deep reinforcement learning (MA-DRL), as stored transitions can become outdated bec...

متن کامل

Accelerating Multi-agent Reinforcement Learning with Dynamic Co-learning

We introduce an approach to adaptively identify opportunities to periodically transfer experiences between agents in large-scale, stochastic, homogeneous, multi-agent systems. This algorithm operates in an on-line, distributed manner, using supervisor-directed transfer, leading to more rapid acquisition of appropriate policies in systems with a large number of cooperating reinforcement learning...

متن کامل

Multi-agent Reinforcement Learning in Network Management

This paper outlines research in progress intended to contribute to the autonomous management of networks, allowing policies to be dynamically adjusted and aligned to application directives according to the available resources. Many existing management approaches require static a priori policy deployment but our proposal goes one step further modifying initially deployed policies by learning fro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Ad hoc networks

سال: 2023

ISSN: ['1570-8705', '1570-8713']

DOI: https://doi.org/10.1016/j.adhoc.2023.103256